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Abstract. We introduce the notion of doubly rooted plane trees and give a 
decomposition of these trees, called the butterfly decomposition which turns out 
to have many applications. From the butterfly decomposition we obtain a one- 
to-one correspondence between doubly rooted plane trees and free Dyck paths, 
which implies a simple derivation of a relation between the Catalan numbers and 
the central binomial coefficients. We also establish a one-to-one correspondence 
between leaf-colored doubly rooted plane trees and free Schroder paths. The 
classical Chung-Feller theorem on free Dyck paths and some generalizations and 
variations with respect to Dyck paths and Schroder paths with flaws turn out 
to be immediate consequences of the butterfly decomposition and the preorder 
traversal of plane trees. We obtain two involutions on free Dyck paths and free 
Schroder paths, leading to two combinatorial identities. We also use the butterfly 
decomposition to give a combinatorial treatment of the generating function for the 
number of chains in plane trees due to Klazar. We further study the average size 
of chains in plane trees with n edges and show that this number asymptotically 
tends to 
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1 Introduction 

This paper is concerned with the enumeration of plane trees and the number of 
chains in plane trees with n edges. Although this subject has been very well 
studied over many decades, it seems that interesting problems still emerge. As 
we shall see, the enumeration of chains in plane trees leads us to discover a 
fundamental property of doubly rooted plane trees which has many applications. 
We call this the butterfly decomposition. 

From the butterfly decomposition, we can establish a correspondence between 
doubly rooted plane trees and free Dyck paths. So we immediately get the re- 
lation between the Catalan numbers and the central binomial coefficients. The 
butterfly decomposition also implies the classical Chung-Feller theorem on free 



Dyck paths with a given number of steps under x-axis. The Chung-Feller theorem 
was first proved by Major Percy A. MacMahon in 1909 ^1 p. 168] but named 
after its 1949 re- discoverers |4. MacMahon proved it using formal series of words 
on an alphabet; Chung and Feller used generating functions. The previous combi- 
natorial approaches to the Chung- Feller theorem are based on the cycle lemma or 
cyclic paths, see Dershowitz-Zaks jH] and Narayana There are other Chung- 
Feller type results and generalizations in [3 O IHl CHI CHI 1221 ■ some sense, the 
butterfly decomposition can be regarded as labelled variation of the cycle lemma. 

The butterfly decomposition also leads to the following results: a correspon- 
dence between leaf-colored doubly rooted plane trees and free Schroder paths, 
a simple bijection between leaf-colored plane trees and Schroder paths, and a 
combinatorial interpretation of the generating function for the number of chains 
in plane trees obtained by Klazar ^S]. We show that there is a one-to-one corre- 
spondence between chains in plane trees and tricolored plane trees (the definition 
is given in Section 5). 

We obtain two involutions on free Dyck paths and free Schroder paths, which 
lead to two combinatorial identities. The last section of this paper gives an 
asymptotic formula for the average size of chains in plane trees with n edges. 



In this section, we introduce the notion of doubly rooted plane trees and their 
butterfly decomposition. This decomposition seems to be fundamental for the 
enumeration of plane trees. It also implies the generating function for the number 
of chains in plane trees obtained by Klazar We will study the enumeration 
of chains in Section 5. The main result of this section is a correspondence be- 
tween doubly rooted plane trees and free Dyck paths, from which it follows a 
combinatorial interpretation of the relation 



We will also establish a correspondence between free Dyck paths and 2-colored 
plane trees. 

A (rooted) plane tree T with a distinguished vertex w is called a doubly rooted 
plane tree, where the distinguished vertex is regarded as the second root. The 
butterfly decomposition of a doubly rooted plane tree T with a distinguished vertex 
w is described as follows. Let P = V1V2 ■ ■ ■ , VkW be the path from the root of T 
to w. Let Li, L2, . . . , be the subtrees such that Lj consists of the vertex Vi and 
its descendants on the left hand side of the path P. Similarly, we can define the 
subtrees Ri, R2, . . . , Rk as the subtrees rooted at vi,V2, ■ ■ ■ ,Vk consisting of the 
descendants on the right hand side of P. Moreover, the subtree of T rooted at 
w is denoted by T'. Therefore, a plane tree T with a distinguished vertex w can 
be decomposed into smaller structures {Ui, U2, ■ ■ ■ , Uk, T'), where Ui is called a 
butterfly consisting of Lj and Ri and the edge in the middle, as shown in Figure [H 



2 The Butterfly Decomposition 




(2.1) 




Figure 1: A Butterfly 



Let B and C be the generating functions of the central binomial coefficients 
and Catalan numbers respectively: 

* n=0 ^ ' 



l_yr3S^^/2,A (2.3) 

2x ^ n + l\n J ' ^ ^ 

where (^") is called the central binomial coefficient, and c„ = (^") is the n-th 
Catalan number . It is clear that the generating function for a butterfly with 
n edges equals xC^, and the generating function for a sequence of k butterflies 
with a total number of n edges equals {xC'^Y. Note that the number of doubly 
rooted plane trees with n edges equals n + 1 times the Catalan number, that is, 
the central binomial coefficient (^^) . Thus, we arrive at the following generating 
function relation: 

B = C + C{xC^) + C{xC^f + ■■■ = — (2.4) 

1 — xC'^ 



A natural question arises: is there a simple combinatorial argument that leads 
to this conclusion without resorting to the formula for the Catalan numbers? The 
answer is affirmative, this leads to a quite simple derivation of the relation (j2.H) . 

Recall that a Dyck path of length 2n is a lattice path from the origin to (2n, 0) 
consisting of up steps (1, 1) and down steps (1,-1) that does not go below the 
X-axis. An elevated Dyck path or an irreducible Dyck path is defined as a Dyck 
path that does not touch the x-axis except for the origin and the destination. A 
lattice path from the origin to (2n, 0) using the steps (1, 1) and (1, —1) without 
additional restrictions is called a free Dyck path, a free Dyck path is also called a 
Dyck path with flaws in the sense that the segments below the x-axis are regarded 
as flaws, see Eu-Liu-Yeh [TU]. The reflection of a Dyck path with respect to the 
X-axis is called a negative Dyck path. An elevated (irreducible) negative Dyck 
path is defined in the same manner. As we shall see, free Dyck paths can be 
regarded as a labelled version of Dyck paths. Clearly, the set of free Dyck paths 
of length 2n is just the set of sequences consisting of n up steps and n down steps, 
as counted by the central binomial coefficient (^^) . 



Theorem 2.1 There is a bisection between the set of doubly rooted plane trees 
with n edges and the set of free Dyck paths of length 2n. 



First we give a combinatorial setting for the proof of the above theorem. We 
recall the classical glove bijection between plane trees and Dyck paths This 
correspondence is also referred to as the preorder traversal of a plane tree. For the 
purpose of this paper, we may view the glove bijection as a recursive procedure. 
Recall that a planted plane tree is a plane tree whose root has only one child. 
Then the glove bijection gives a correspondence between the set of planted plane 
trees with n edges and the set of elevated Dyck paths of length 2n. A planted 
plane tree with one edge corresponds to the elevated Dyck path of length two. Let 
T be a planted plane tree, and let Ti, T2, . . . , be the subtrees of the only one 
child of the root of T. Let Pi, P2, ■ ■ ■ ,Pk be elevated Dyck paths corresponding 
to Ti, T2, . . . , Tfc respectively. Then UPiP2 - ■ ■ PkD is an elevated Dyck path of 
length 2?2, where U stands for an up step and D stands for a down step. 

We are now ready to give a proof of Theorem 12.11 

Proof. Let T be a doubly rooted plane tree with n edges. Let w be the distin- 
guished vertex of T and let V1V2 ■ ■ ■ v^w be the path from the root to w. Suppose 
that (Li, L2, R2] ■ ■ ■ L^, i?^; T') is the butterfly decomposition of T. 

For the Li {1 < i < k) and T', we use the glove bijection to them and call 
the resulting Dyck paths Pi and Pk+i- For every we first add an edge at each 
root to form a planted plane tree Tj, then use the glove bijection to produce a 
negative elevated Dyck path Qi. Now 

-Pi Q1P2Q2 ■ • • Pk Qk Pk+i (2.5) 

is a free Dyck path of length 2n. Conversely, given a free Dyck path we may de- 
compose it into elevated (irreducible) segments like the first return decomposition 
of a Dyck path jTj , and we may reverse the above procedure to construct a doubly 
rooted plane tree because any free Dyck path P has a unique decomposition in 
the form ()2.5j) such that Qi,Q2, ■ ■ ■ ,Qk are negative elevated Dyck paths and 
Pi, P2, . . . , Pfc+i are the usual Dyck paths with the empty paths allowed. Thus 
we have established the bijection. I 

An example of the above bijection is shown in Figure |21 

We next give another interpretation of the generating function for the number 
of bicolored plane trees. Guided by the following generating function identity 

C 1 

we are led to introduce the notion of bicolored plane trees and fc-colored plane 
trees, in general. A k-colored plane tree is a plane tree in which the children of the 
root are colored with k colors. A 2-colored plane tree is called a bicolored plane 
tree, and a 3-colored plane tree is called a tricolored plane tree. For bicolored 
plane trees, we assume that the two colors are black and white. Note that this 
terminology is somewhat misleading because in our context only the children of 
the root are colored. The relation ()2.6p indicates that the set of bicolored plane 
trees are in one-to-one correspondence with doubly rooted plane trees. We next 
establish such a correspondence by making a connection between bicolored plane 
trees and free Dyck paths. 




Figure 2: Doubly rooted plane trees and free Dyck paths 

Theorem 2.2 There is a one-to-one correspondence between the set of bicolored 
plane trees with n edges and the set of free Dyck paths of length 2n. 

Proof. Let T be a bicolored plane tree, and let Ti,T2, . . . ,Tk be the planted 
subtrees of the root of T, listed from left to right. If T, inherits the black color, 
then we construct an negative elevated Dyck path Pi from Tf, otherwise we con- 
struct an elevated Dyck path Pj above the x-axis. So we get a free Dyck path 
P1P2 ■ ■ ■ Pk- Conversely, given a free Dyck path we may construct a bicolored 
plane tree. Hence we obtain the desired bijection. I 

The bijections in Theorems 12.11 and 12.21 lead to a bijection between doubly 
rooted plane trees and bicolored plane trees. In fact, we may establish a direct 
correspondence without resorting to free Dyck paths. 

Theorem 2.3 There is a bijection between the set of doubly rooted plane trees 
with n edges and the set of bicolored plane trees with n edges. 

Proof. By the butterfly decomposition in Theorem 12.11 we get subtrees Lj, Tj 
and T' . By coloring Li and T' black while Tj white, and identifying their roots 
as the root of the corresponding bicolored plane tree, we have its subtrees listed 
from left to right as 

Li Ti L2T2 ■ ■ ■ Lk Tk T' . 

The reverse procedure is easy to construct. Thus we have established the bijec- 
tion. I 



3 The Chung-Feller Theorem 



We begin this section by pointing out that the classical Chung-Feller theorem 
on Dyck paths is an immediate consequence of our bijection between doubly 



rooted plane trees and free Dyck paths. To see this connection, one only needs 
a simple observation on the preorder traversal of a plane tree. We also use this 
idea to derive some refinements and generalizations of the Chung-Feller theorem, 
including some recent results of Eu, Fu and Yeh [9^ on Dyck paths and Schrooder 
paths with flaws. 

Theorem 3.1 (Chung- Feller) For any < m < n, the number of free Dyck 
paths of length 2n that contain exactly 2m steps below the x-axis is independent 
of m, and is equal to the n-th Catalan number Cn- 

Using the butterfly decomposition, we may transform the Chung-Feller theo- 
rem to an equivalent form on plane trees, which turns out to be a simple prop- 
erty of the preorder traversal. To be precise, we define the right-to-left preorder 
traversal of a plane tree T as a recursive procedure. First, visit the root of T. 
Let Ti, T2, . . . , Tfc be the subtrees of the root of T listed from left to right. Then 
traverse the subtrees in the order of Tk,Tk-i, . . . ,Ti. From the above traversal 
procedure, we may label the vertices of T with the numbers 0, 1, 2, . . . , n in the 
order that they are visited. Figure El gives the plane tree corresponding to the 
free Dyck path in Figure El and the labels by the right-to-left preorder traversal. 








Figure 3: Labels for the Chung-Feller theorem 

The following property immediately implies the Chung-Feller theorem since 
any plane tree can be regarded as a doubly rooted plane tree in which the distin- 
guished vertex is chosen as the vertex with a given label m with respect to the 
right-to-left preorder traversal. 

Theorem 3.2 Let T be a plane tree with n edges. Assume that the vertices ofT 
are labelled by 0, 1, 2, . . . , according to the right-to-left preorder traversal. Let 
w be the vertex labelled by m, where m is a given number not exceeding n. Then 
the doubly rooted plane tree T with w being the distinguished vertex corresponds 
to a free Dyck path with m down steps (up steps) below the x-axis. 

As another corollary, we note that half of all free Dyck paths end with an up 
step. Thus over all plane trees with n edges, half of the vertices are leafs, see 
Problem 10753 of the American Mathematical Monthly [T^ I16j. 

The above interpretation of the Chung-Feller theorem also implies some re- 
finements and generalizations recently obtained by Eu, Fu and Yeh Let us 



define some terminology. We say that a free Dyck patli lias m flaws if it contains 
m up (or down) steps below the x-axis. We note that a negative elevated (irre- 
ducible) Dyck path is called a flaw block by Eu, Fu and Yeh jH] . We define the 
stem of a doubly rooted plane tree as the path from the root to the distinguished 
vertex. Let T be a doubly rooted plane tree with a distinguished vertex w. An 
edge of T is said to be a prefix edge if it is either on on the stem of T or to 
the right of the stem. In other words, a prefix edge is an edge with labels not 
exceeding the label of the distinguished vertex with respect to the right-to-left 
preorder traversal. An example is shown in Figure El where the prefix edges are 
drawn with thick edges. 

Using the preorder traversal of plane trees, we get the following generalization 
of the refined version of the Chung- Feller theorem jH] . 

Theorem 3.3 For < k < m < n, there is a bijection between the set of free 
Dyck path of length 2n with m flaws in k flaw blocks and the set of doubly rooted 
plane trees of n edges with stem size k and m prefix edges. 

Proof. From the butterfly decomposition and the correspondence in Theorem 12. 1|, 
we see that the number of flaw blocks in a free Dyck path equals the stem size 
of the corresponding doubly rooted plane tree, and the number of flaws in a free 
Dyck path equals the number of prefix edges in the plane tree. This completes 
the proof. I 

By the butterfly decomposition, one sees that the generating function for 
doubly rooted plane trees with stem size k equals x^C'^ ■ C^~^^. It follows that 
the number of such trees with n edges and m prefix edges equals [x"^]x''C'' ■ 
|-^n-mj^fc+i^ where [^"jC'^ is the usual notation for the coefficient of x^ in the 
expansion of C'^. By the Lagrange inversion formula |2I], we have 



Thus, we obtain the following expression. 

Corollary 3.4 For < k < m < n, the number of free Dyck paths of length 2n 
with m flaws and k flaw blocks equals 



Setting m = n in the above corollary, one gets the number of Dyck paths of 
length 2n with k returns obtained by Deutsch [7]: 



We next consider the enumeration of Schroder paths with flaws. For this 
purpose, we need to introduce the notion of leaf-colored doubly rooted plane trees 




(3.1) 





(3.2) 



which are defined as doubly rooted plane trees whose leaves are colored with two 
colors red (R) and blue (B) under the convention that the distinguished vertex 
receives no color even if it is a leaf. An edge of a plane tree is called an external 
edge if it's end vertex is a leaf; otherwise it is called an internal edge. As we shall 
see, such leaf-colored doubly rooted plane trees are in one-to-one correspondence 
with free Schroder paths. We note that there is a bijection between Schroder 
paths and plane trees with every leaf being colored red or blue, see Gouyou- 
Beauchamps and Vauquelin here we create a new one. 

Recall that a Schroder path of length 2n is a lattice path in the plane from (0, 0) 
to (2ra, 0) with up steps U = (1, 1), horizontal steps H = (2, 0), and down steps 
D = (1,-1), that never go below the x-axis. These paths are enumerated by the 
Schroder numbers r„ [21]. An elevated (irreducible) Schroder path and a negative 
Schroder path are defined in the same manner as with Dyck paths. A lattice path 
from (0, 0) to (2n, 0) with steps U = (1,1), H = (2, 0), and D = (1, -1) without 
additional restrictions is called a free Schroder path. We say that a free Schroder 
path has m flaws if the number of U steps and H steps under the x-axis equals 
m. A flaw block of a Schroder path is defined as a negative elevated Schroder 
path. 

By the preorder traversal, we obtain the following correspondence. 

Theorem 3.5 There is a one-to-one correspondence between the set of plane 
trees with n edges in which each leaf is colored red or blue and the set of Schroder 
paths of length 2n. 

Proof. Let T be a plane tree with n edges in which each leaf is colored red or 
blue. We proceed to construct a Schroder path of length 2n from the (left-to- 
right) preorder traversal. In the preorder traversal of the vertices of T, each 
edge is visited twice. Note that when an external edge e = (m, v) {v is a leaf) is 
traversed, one always visits the vertex u, then the leaf v, and then immediately 
goes back to the vertex u. Now we may generate a sequence of U, D, and H 
steps by the following rule: (1) When an internal edge is visited for the first time, 
we get an U step. (2) When an internal edge is visited for the second time, we 
get a D step. (3) When an external edge with a red leaf is traversed, we get two 
steps UD. (4) When an external edge with a blue leaf is traversed, we get an H 
step. It is easy to see that we obtain a Schroder path of length 2n and the above 
procedure is reversible. I 

By using the butterfiy decomposition, we obtain the following correspondence. 

Theorem 3.6 There is a bijection between the set of leaf-colored doubly rooted 
plane trees with n edges and the set of free Schroder paths of length 2n. 

Proof. Similar to that of Theorem 12.11 

Recall that the number of plane trees with n edges and i leaves is given by 
the Narayana number [21] 




It follows that that the number of leaf-colored doubly rooted plane trees equals 



n n 




On the other hand, it is easy to see that the number of free Schroder paths of 
length 2n is given by the summation 



By the right-to-left preorder traversal and the above correspondence, one may 
determine a distinguished vertex of a plane tree whose leaves are colored red and 
blue. This fact can be restated as a Schroder path analogue of the Chung-Feller 
theorem obtained by Eu, Fu and Yeh P . 

Theorem 3.7 For each Schroder path P from (0,0) to (2n,0), assign weight 2 
to P if P ends with a U step; otherwise P is assigned weight 1. Let m be a given 
number not exceeding n. Then the total weight of the set of free Schroder paths 
of length 2n with m flaws is always the Schroder number Tn- 

If a free Schroder path ends with an up step, then the corresponding subtree 
T' is empty and we have that the distinguished vertex is a leaf. There are now 
two possible ways to color it, hence we assign weight 2 to this kind of Schroder 
paths. 

Using plane trees, we may reinterpret the above theorem as follows. 

Theorem 3.8 Let T a plane tree with n edges. Assume that the vertices of T 
are labelled by 0, 1, 2, . . . , n according to the right-to-left preorder traversal. Let 
w he a vertex labelled by m. Let T' be a leaf-colored doubly rooted plane tree T 
with w being the distinguished vertex. Then by the correspondence between leaf- 
colored doubly rooted plane trees and free Schroder paths, T' corresponds to a free 
Schroder path with m flaws. 

From the above theorem, we immediately get the following refinement of Eu, 
Fu and Yeh |9j. 

Theorem 3.9 For < k < m < n, there is a bijection between the set of free 
Schroder path of length 2n with m flaws in k flaw blocks and the set of leaf-colored 
doubly rooted plane trees of n edges with stem size k and m prefix edges. 




Hence Theorem 13 . 61 vields the following identity: 




(3.3) 




Figure 4: Leaf-colored plane trees and free Schroder paths 

An example of the above bijection between leaf-colored doubly rooted plane 
trees and free Schroder paths is illustrated in Figure El 

To conclude this section, we use the butterfly decomposition to obtain a for- 
mula for the total weight of leaf-colored doubly rooted plane trees of n edges with 
stem size k and m prefix edges. Let S be the generating function of the Schroder 
numbers as given by the equation 5* = 1 + xS + xS'^. Then the total weight of 
leaf-colored doubly rooted plane trees of n edges with stem size k and m prefix 
edges equals 

2 ■ [x^Jx'^S^- ■ + [x"'']x''S'' ■ - 1) 

which can be rewritten as [x"^~^]S''[x"-^"^]{S''^^ + S^). Let 

a(n,A;) = (3.4) 

Set a{0,k) = 1. When n > 1, using the Lagrange inversion formula [21], we 
obtain that 




Note that a(n, 1) reduces to the Schroder number r„. 

Corollary 3.10 For < k < m < n, the total weight of free Schroder paths of 
length 2n with m flaws and k flaw blocks equals 

a{m — k,k) ■ [a{n — m, k + 1) + a{n — m, k)]. 

4 Two Involutions 

In this section, we present two parity reversing involutions on free Dyck paths 
and free Schroder paths, where the parity is defined as the parity of the number 



of its flaw blocks. We also derive two identities based on the computation via the 
butterfly decomposition. 



Theorem 4.1 For n > 1, there is a parity reversing involution on the set of free 
Dyck paths of length 2n, which leads to the following identity 

1=0 ' \ / 



Proof. Let P be a free Dyck path of length 2n. Let be the desired involution. If 
P ends with an elevated Dyck path, then we construct (f){P) by reflecting the last 
elevated Dyck path with respect to the x-axis. If P ends with a negative elevated 
Dyck path, then (f){P) is obtained by reflecting the last negative elevated Dyck 
path with respect to the x-axis. Clearly, is a parity reversing involution. By the 
butterfly decomposition, the number of free Dyck paths with i flaw blocks equals 
the number of doubly rooted plane trees with stem size i, that is, [a;"~*]C^*"'"^. 
Hence the relation 1)4.11) follows from ()3.ip . I 

We also have an involution on free Schroder paths. 



Theorem 4.2 For n > 1, there is a parity reversing involution on the set of 
free Schroder paths of length 2n containing at least one up step. So we have the 
following identity on a{n, k) as defined by \S.,^) : 

n 

^(-l)*a(n-i,22 + 1) = 1. (4.2) 

i=0 



Proof. Let P be a free Schroder path which contains at least one up step. Let 
Q be the last segment of P which is an elevated Schroder path or a negative 
elevated Schroder path. Note that Q may be followed by some horizontal steps in 
P. We reflect Q with respect to the x-axis to get a free Schroder path. Clearly, 
the resulting path contains at least one up step. It is easy to see that this 
construction is reversible and parity reversing. By the correspondence given in 
Theorem 13. 9| the number of free Schroder paths of length 2n with i flaw blocks 
equals [x"~*]S'^*^"^, that is, a{n—i, 2^+1). Therefore, identity ()4.2|) follows from the 
involution and the fact that the only Schroder path not affected by the involution 
is the path consisting of only horizontal steps. I 



5 Chains in Plane Trees 



Let us recall that a chain of a plane tree is a selection of vertices on a path from 
the root to a leaf. The size of a chain is defined as the number of vertices in 
the chain. Let Qn be the number of nonempty chains in all plane trees with n 
edges. A tree with n edges may have as many as 2"^^ — 1 non-empty chains and 
as few as 2n + 1. The twelve chains in plane trees with 2 edges are illustrated in 



Figure |31 where those empty circles stand for vertices in chains and black circles 
stand for normal vertices. For instance, the last structure of Figure El has a chain 
of size 2. 




Figure 5: Chains in plane trees with 2 edges 

The main result of this section is a combinatorial interpretation of the gen- 
erating function for the number of chains in plane trees obtained by Klazar ^3] • 
We also obtain a one-to-one correspondence between the set of chains in plane 
trees with n edges and the set of tricolored plane trees with n edges. Klazar 
derived the following generating function for the number of chains in plane trees 
with n edges: 

C 

-— = 1 + 3a; + 12x^ + 51x^ + 222x^ + 978x^ + ■ ■ ■ . (5.1) 

Note that here we use a slightly different formulation of the generating function 
C from that used by Klazar [T!^ . 

We now give a combinatorial proof of the fact that the generating function of 
chains in plane trees with n edges equals jz^^- Let T be a plane tree and Q be 
a chain of T. Suppose w is the vertex in Q such that the path f if 2 ■ ■ ■ VkW from 
the root of T to w contains all the vertices in Q. Moreover, we color the vertex 
Vi with the white color if it belongs to Q; otherwise, we color Vi with the black 
color. Such a coloring scheme leads to the following bijection. 



Theorem 5.1 There is a one-to-one correspondence between the set of chains 
in plane trees with n edges and the set of doubly rooted plane trees in which the 
vertices on the path from the root to the distinguished vertex (but not including 
the distinguished vertex) are colored with two colors. 



Using the above theorem and the butterfly decomposition of doubly rooted 
plane trees, we obtain the generating function of Klazar. 

Motivated by the following relation 

CI , , 

(5.2) 



1 - 2xC2 1 - 3xC 
we are led to establish the following bijection. 



Theorem 5.2 There is a one-to-one correspondence between chains in plane 
trees and tricolored plane trees. 



Proof. Let T be a plane tree and Q be a chain of T. Let V1V2 ■ ■ ■ VkW be the path 
from the root to the vertex w, where w is the last vertex in the chain. Suppose 
that (Li, L2, R2', • • • ; -Z^fc, Rk] T') is the butterfly decomposition of T. Let % 
be the planted plane tree obtained from Ri by adding a root. Coloring Li and T' 
red, and color Tj white if the vertex Vi contained in Tj is a chain vertex, otherwise 
color Tj black. Identify their roots as the root of the corresponding tricolored 
plane tree, and set the subtrees of the root as 

Li Ti L2T2 ■ ■ ■ Lk Tfc T' . 

The reverse procedure is easy to construct. This completes the proof. I 
An example of the above bijection is shown in Figure IHl 




Figure 6: Chains and tricolored plane trees 



From the above bijection, we easily see that chains with m vertices corre- 
spond to tricolored trees with m — 1 white subtrees. Hence as a special case of 
Theorem 15. 2| we obtain Theorem 12. HI 

Notice that a chain in plane trees is just a two colored path in the butterfly 
decomposition. Hence we can color the vertices in chain with t colors and preserve 
these colors in the above bijection. Precisely speaking, a chain is called t-colored 
if its elements are t-colored. We have the following bijection. 



Theorem 5.3 There is a one-to-one correspondence between the set of {k — 2)- 
colored chains in plane trees with n edges and the set of k- colored plane trees with 
n edges. 



The above bijection is a reflection of the following Catalan type identity 

C _ 1 

l-{k- l)xC2 ~ 1 - kxC 



Remark. By the path decomposition, the generating function for the number of 
chains with n edges that end with a leaf equals 

1 1 + a/I - 4X 9-^4^ 



1 - 2xC2 3v/r^4^ - 1 



It is a new combinatorial explanation for Sequence A067336 in [TH] . 



6 Average Size of Chains 



In this section, we use the generating function B of central binomial coefficients 
as given by ()2.2|1 to study the total size and average size of chains in plane trees 
with n edges. It turns out that by a decomposition of chains we may rewrite 
i_2xc'^ order to give an asymptotic formula. We show that the average size of 
chains in plane trees with n edges asymptotically tends to 

Bearing in mind that the generating function for the number of chains of size 
1 equals the generating function B of the central binomial coefficients. We let 
L* be the generating function for the number of plane trees with a distinguished 
leaf. Any tree with a distinguished vertex can be decomposed into a tree with 
a distinguished leaf and a subtree rooted at the distinguished vertex. Thus we 
have B = L*C and L* = B/C. 

We now consider plane trees with at least two vertices in which there is a 
distinguished leaf. Let L be the generating function of such plane trees with n 
edges. It is easy to obtain the following relations 

L = L--l = ^ = ^. 

C 2 

Property 6.1 The generating function for the total number of chains of size k 
in plane trees with n edges equals B ■ {^y^) 

Proof. The required generating function follows from a decomposition procedure 
for a plane tree with a given chain. Let T be a plane tree and Q be a chain of 
T. Let Wi,W2,...,Wk be the chain vertices on the path from the root to the last 
vertex Wk- Then T can be decomposed into k + 1 plane trees Ti, T2, . . . , T^, and 
T', where Ti is constructed from T by cutting off the subtrees of Wi, T2 is obtained 
from the subtree of T rooted at Wi by cutting off the subtrees of W2, and so on, 
finally T' is the subtree of T rooted at Wk- The vertices Wi,W2,---,Wk serve as 
distinguished vertices in Ti, T2, . . . , T^. The generating function for the structure 
of Ti equals L* = B/C, since the distinguished vertex is allowed to coincide with 
the root in Ti. The generating function for other Ti {2 < i < k) equals L and 
the generating function for T' equals C. Hence the required generating function 
equals L* ■ L''-^ ■ C = B ■ {^)''~\ I 

An interesting case arises if we look at chains of size 3 that include both 
the root and a leaf. In this case we have as our generating function. It is 
easily shown that = + 6x^ + 29x'^ + 130x^ + ■ ■ ■ . This ubiquitous sequence, 
A008549, also counts dH: 

• The area under all Dyck paths of length 2n — 2. 

• The number of points at height one over all binomial paths of length 2n — 2. 



The number of inversions among all 321-avoiding permutations in S, 



From the above theorem, we have the following generating function for the 
total number of chains in all plane trees with n edges. 



Theorem 6.2 The generating function for the total number of nonempty chains 



Proof. We sum over k to get B ■ J2k>o = B ■ {l - ^) ^ = I 



Now we consider the asymptotic approximations. Let be the total number 
of chains in plane trees with n edges. Klazar |13j has shown that 



Now we use the language of Riordan arrays [131211] to compute the generating 
function for the total size of chains in all plane trees with n edges. The idea of 
Riordan arrays is represented as follows. Given two generating functions g{x) = 

l + gix + g2X^ H and /(x) = fiX + f2X^ H with /i 7^ 0, let M = {rnij)ij>o 

be the infinite lower triangular matrix with nonzero entries on the main diagonal, 
where rriij = [x^]{g{x)f^{x)), namely, rriij equals the coefficient of in the 
expansion of the series g{x)f^{x). If an infinite lower triangular matrix M can 
be constructed in this way from two generating functions g{x) and f{x), then it 
is called a Riordan array and is denoted by M = {g{x), f{x)) = {g, f)- 

If we multiply the matrix M = {g, f) by a column vector (oq, ai, ■ ■ ■ )^ to get 
a column vector (60, ■ ■ ■ )"^, then the generating functions A[x) and R{x) of the 
sequences (oq, ai, ■ ■ ■ ) and (60, ■ ■ ■ ) satisfy the following relation 



We now have the following generating function for the total size of chains in 
plane trees with n edges. 

Theorem 6.3 The generating function for the total size of all chains in plane 
trees with n edges equals 77^^- 

Proof. Let g{x) = B be the generating function for chains of size 1, and f{x) = 
L = be the generating function for plane trees with at least two vertices 
and a distinguished leaf. Consider the Riordan matrix {B,L). The generating 
function of the j-th (j > 1) column is BL^^~^\ which is the generating function 
for the number of chains of size j. Since the generating function of (1, 2, 3, 4 ■ ■ ■ )^ 
is A{x) = (jT^, it follows that the multiplication of the Riordan matrix {B,L) 
and the column vector (1, 2, 3, 4 ■ ■ ■ )^ gives the sequence of the total size of chains 
in plane trees with n edges. It follows that 



in all plane trees with n edges equals gire- 




(6.1) 



R{x)=g{x)A{f{x)). 



R{x) = g{x)AU{x)) 



B AB 



(1-L)2 {3 -By 



is the generating function for the total size of chains. The matrix identity can be 

stated as 



1 










"l" 




" 1 " 


2 


1 








2 




4 


6 


5 


1 






3 




19 


20 


22 


8 


1 




4 




92 


70 


93 


47 


11 1 




5 




446 



This completes the proof. 

After some algebraic calculations we get 



5-18x 



R 



= + 3 



4x 



8 



(1 - fx) 



Recall that Bender's lemma ^1 p. 496] basically says that if C (x) = A (x) B (x) 
and the radii of convergence for A{x) and B{x) are a and (3 with a < (3, then 



Let A ix) 



Cn ~ A^B (a) 

and B{x) = \- + 3 ) . We have a = 2/9 < /5 = 1/4 



for Bender's lemma. So we have B (|) = | while A^ = (|)" ^ ■ So we obtain 
the following asymptotic property. 



Theorem 6.4 Let R^ be the total size of chains in all plane trees with n edges. 
Then we have 



Rn 



n + 9 /9 
12 V 2 



(6.2) 



From Klazar's formula ()6.H) and the above formula ()6.2|) it follows that the average 
size of chains in planes trees with n edges approaches 



Rn 

hm — 



n + 9 
6 



For example, 

i?50 2250 588 247 788 344 466 951 528 963 319 620 



50 



228 878 511 199 384 804 987 952 173 176 432 



9.8331, 



while 



50+9 



9.833 3. 
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